All in Strings: a Powerful String-based Automatic MT Evaluation Metric with Multiple Granularities

نویسندگان

  • Junguo Zhu
  • Muyun Yang
  • Bo Wang
  • Sheng Li
  • Tiejun Zhao
چکیده

String-based metrics of automatic machine translation (MT) evaluation are widely applied in MT research. Meanwhile, some linguistic motivated metrics have been suggested to improve the string-based metrics in sentencelevel evaluation. In this work, we attempt to change their original calculation units (granularities) of string-based metrics to generate new features. We then propose a powerful string-based automatic MT evaluation metric, combining all the features with various granularities based on SVM rank and regression models. The experimental results show that i) the new features with various granularities can contribute to the automatic evaluation of translation quality; ii) our proposed string-based metrics with multiple granularities based on SVM regression model can achieve higher correlations with human assessments than the stateof-art automatic metrics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Normalized Compression Distance as automatic MT evaluation metric

This paper evaluates a new automatic MT evaluation metric, Normalized Compression Distance (NCD), which is a general tool for measuring similarities between binary strings. We provide system-level correlations and sentence-level consistencies to human judgements and comparison to other automatic measures with the WMT’08 dataset. The results show that the general NCD metric is at the same level ...

متن کامل

A Fast and Accurate Global Maximum Power Point Tracking Method for Solar Strings under Partial Shading Conditions

This paper presents a model-based approach for the global maximum power point (GMPP) tracking of solar strings under partial shading conditions. In the proposed method, the GMPP voltage is estimated without any need to solve numerically the implicit and nonlinear equations of the photovoltaic (PV) string model. In contrast to the existing methods in which first the locations of all the local pe...

متن کامل

Normalized Compression Distance Based Measures for MetricsMATR 2010

We present the MT-NCD and MT-mNCD machine translation evaluation metrics as submission to the machine translation evaluation shared task (MetricsMATR 2010). The metrics are based on normalized compression distance (NCD), a general information theoretic measure of string similarity, and evaluated against human judgments from the WMT08 shared task. The experiments show that 1) our metric improves...

متن کامل

The Role of Pseudo References in MT Evaluation

Previous studies have shown automatic evaluation metrics to be more reliable when compared against many human translations. However, multiple human references may not always be available. It is common that automatic metrics must make judgments based on a single human reference (extracted from parallel texts) or no reference at all. Our earlier work suggested that a promising way to address this...

متن کامل

Effects of Disc Insulator Type and Corona Ring on Electric Field and Voltage Distribution over 230-kV Insulator String by Numerical Method

Insulator strings with several material and profiles are very common in overhead transmission lines. However, the electric field and voltage distribution of insulator string is uneven which may easily lead to corona, insulators’ surface deterioration and even flashover. So the calculation of the electric field and voltage distribution along them is a very important factor in the operation time....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010